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Abstract 

The Hopfield-Tank (1985) recurrent neural network architecture for the Travehng Salesman Prob- 
lem is generalized to a fully interconnected "cellular" neural network of regular oscillators. Tours 
are defined by synchronization patterns, allowing the simultaneous representation of all cyclic per- 
mutations of a given tour. The network converges to local optima some of which correspond to 
shortest-distance tours, as can be shown analytically in a stationary phase approximation. Simu- 
lated annealing is required for global optimization, but the stochastic element might be replaced 
by chaotic intermittency in a further generalization of the architecture to a network of chaotic 
oscillators. 
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An artificial neural network with a symmetric connection allowed between 
every pair of "neuronal" units evolves to a state that satisfies an optimization 
principle expressed in terms of the weights defining those connections. In a 
Hopfield associative memory network^, the optimization principle is satisfied 
when the network settles to one of many pre-stored patterns, depending on 
the initial state. In the Hopfield-Tank^ construction, weights were chosen so 
that the stable fixed points of the network correspond to shortest-distance tours 
among a set of cities arbitrarily laid out on a map, offering a potential solution 
to the algorithmically difficult Travelling Salesman Problem (TSP). A stochastic 
element was introduced in a "simulated annealing" scheme so that the network 
would escape local optima, such as those corresponding to good tours that are 
not necessarily shortest. Here, the Hopfield-Tank scheme is generalized to a 
"cellular neural network" in which the units are dynamical systems that oscillate 
in the absence of input from other units, arguably resembling real neurons a bit 
more closely. The representation of the TSP turns out to be somewhat more 
natural in such a network, where tours are defined in terms of synchronization 
patterns among the oscillating units. Additionally, intermittent synchronization 
might naturally fill the role of the stochastic element in the original scheme. 
In this work we construct a network of regular oscillators that may solve the 
TSP about as well as the original Hopfield-Tank network, as preparation for an 
extension to a network of chaotic oscillators. 

I. INTRODUCTION 

The Hopfield-Tank neural network solution to the Travelling Salesman Problem was a 
milestone in neuromorphic computation. While the proposed architecture provided a rather 
poor TSP solution, finding only locally optimal paths for problems of touring more than a 
few cities, the approach supported the use of more successful "neural" solutions to simpler 
problems. In the associative memory problem, one simply seeks the closest local optimum; 
the issue of avoiding such solutions in favor of a global optimum does not arise. That a neural 
solution to the TSP exists at all still speaks to the power of the neuromorphic approach. 

In more recent years the suggestion that the neuronal units of the old-style networks ought 
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to be replaced by oscillating dynamical systems has become popular-. The oscillations are 
analogous to spike trains in real neurons. The resulting cellular neural networks (CNNs) 
arguably resemble biological systems more closely than their older counterparts, in which a 
"neuron" exists in a fixed state and outputs a pre-defined, approximately binary function 
of a collection of inputs. Attempts to capture the properties of biological systems in simple 
models, such as the Grossberg-Mingolla model of early vision^, indeed depend on neuronal 
dynamics. Further, in the realm of practical application, CNN architectures lend themselves 
to implementation in analog hardware. 

A CNN generalization of the Hopfield associative memory network has recently been 
proposed^'^. Some effort was required to stabilize the desired retrieval state, as compared to 
the original Hopfield network. The resulting network performed almost as well as its fixed- 
state counterpart, a finding that was taken to imply a broad utility for the CNN paradigm. 

Relationships between oscillating "cellular" neurons offer richer possibilities for represen- 
tation of solutions and of intermediate computational steps. Synchronization of two or more 
neurons (i.e. phase-locking) was the defining relationship in the proposed CNN associative 
memory architecture. More generally, synchronization offers a representation of binding that 
may be useful for image segmentation and other combinatorial problems^. Such applications 
are in line with the suggested role of synchronization in perceptual grouping in biological 
systems^i^ii^. We show here that synchronization in a CNN provides a particularly natural 
representation of the TSP, yielding a solution that generalizes the older Hopfield method. 
In future work, we will examine the use of synchronization, as an intrinsically loose form of 
binding, to avoid the problem of local optima. 

We first summarize the original Hopfield solution in the next section, and then describe 
the CNN generalization in Section 3. Results are presented in Section 4, where the essential 
role of stochasticity in a simulated annealing scheme is also demonstrated. The stochastic 
version leads to a suggestion in the concluding section that the instability that was seen 
as an obstacle to the design of CNN's for associative memory may be beneficial in CNN's 
designed for optimization. 
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II. BACKGROUND: THE HOPFIELD TSP SOLUTION 



The Hopfield-Tank network is a traditional neural network, fully interconnected, with 
fixed weights chosen so that the globally optimal state corresponds to a shortest-distance 
"tour" among a collection of n cities with a pre-specified distance for each pair of cities. In 
this original representation, one considers an n x n matrix of binary values, where the rows 
correspond to cities and the columns correspond to slots, 1 through n, in the tour schedule. 
A tour is any pattern of O's and I's, such that there is exacly one 1 in each row (exactly 
one city visited at a time) and one 1 in each column (each city visited exactly once). For 
instance the tour depicted in Fig. [T^ is ECABD. For the 5-city problem, there is a 10-fold 
degeneracy in optimal patterns (shortest-distance cyclic tours) due to arbitrariness in the 
selection of the starting city and the direction of the tour. The travelling salesman problem, 
in this representation or any other, is difficult because of the multiplicity of local optima. 

The Hopfield and Tank network converges in a few neural time constants for small n, 
but the required time grows rapidly with n and sub-optimal solutions are typically found. 
The nonlinear response function of each unit is key to the operation of the network. Flex- 
ibility comes from embedding a discrete problem in a continuous decision space. That 
is, each unit i generates a nearly, but not quite binary output gmoid function 

V{ui) = |(1 + tanh(uj/Mo)) of a continuous valued input Uj. Consider a general, fully 
interconnected network of units (here A^ = rP), where each unit is updated according to: 



where Tij is a symmetric matrix defining the connections among the units in the network, 
the /j are biases, and r is a decay time. It can be shown that an energy function 



is locally minimized, by computing the time derivative dE/dt and noting that the resulting 
form is negative semi-definite and that E is bounded below. In the limit of high gain (small 
Uo), for which the sigmoid function is infinitely steep, the minima only occur when each Vi 
is or 1, i.e. at the corners of the A^-dimensional hypercube [0, 1]^ that is the state space 
of the network described in terms of the variables Vi. 




(1) 




(2) 
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The class of optimization problems that can be formulated in this manner is quite large, 
as illustrated by the Travelling Salesman Problem, which is a priori different in form. The 
TSP energy is 

X i j^i i X Y^X 

. X i 

+ Y E E E dxYVx^iVY,+l + (3) 
X YytX i 

where the summation indices X and Y range over the n cities, and the indices i and j range 
over the n slots in the schedule. The first term in ([3]) inhibits multiple I's in each row, the 
second term inhibits multiple I's in each column, the third term tends to force the total 
number of I's in the matrix to be exactly n, and the last term minimizes the total distance 
associated with any pattern of O's and I's that defines a valid tour. 

The network method of optimization succeeds largely because difficult decisions between 
similar tours can effectively be postponed until the end of a calculation. In the interim, 
non-tour patterns (corresponding to multiple visits to a city or visits to more than one city 
at a time) can be regarded as representing alternative choices simultaneously. Choices can 
thus be winowed incrementally toward a true optimum. Nonetheless, the network tends to 
converge to local optima, rather than the global optimum, especially as n becomes large. 

It was found that global optimization performance could be greatly improved by a "sim- 
ulated annealing" technique in which a stochastic component of gradually decreasing am- 
plitude is added to the dynamics. In the original Hopfield annealing scheme, the variables 
Vi were reinterpreted as expectation values of a statistical distribution of "spin states" , and 
a lowering of "temperature" was effected by increasing the "gain" in the sigmoid function 
that determines Vi, i.e. decreasing Uo- The stochastic component can also be represented 
directly as a noise term in the dynamical equations. 

III. A 'CELLULAR' GENERALIZATION OF THE HOPFIELD SOLUTION 

The representation considered here is an n x array of coupled periodic oscillators. A 
tour is specified by a synchronization pattern in which all oscillators in each row and each 
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FIG. 1: In the original Hopfield representation (a), a tour (here ECABD) is specified by the 
collection of units that are "on", provided there is only one "on" unit in each row and only one 
in each column. In the CNN representation (b) equivalent cyclic permutations of the same tour 
are simultaneously represented as sets of 5 units that are synchronized, with an analogous proviso. 
Units with the same relative phase are shown in the same color. (Dash marks distinguish ambiguous 
colors in the grayscale version.) 

column are desynchronized, and such that for each oscillator in each column (row), there is 
exactly one oscillator in each other column (row) that is synchronized with it. In Fig. [T]d, 
the same tour ECABD is depicted in the new representation, simultaneously with equivalant 
tours given by cyclic permutations of cities: CABDE, ABDEC, etc. There is now only a 
two-fold degeneracy in optimal patterns, due to arbitrariness in direction. 

To solve the travelling salesman problem in the representation we have described, let 



each oscillator be given by a complex number z, 



1,2, 



A,B,...Xn) that 



contains both a phase arg(zjj) and an amplitude \zij\. Assume all oscillators have the same 
frequency u and make the replacement Zij exp{—iujt)zij, so that only the relative phases 
are represented in the complex quantities Zij. The Lyapunov function we seek to minimize 
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is: 



Zij\ 



Zij' 



-DEE 



I ^i'j I 



'j+i,j' 



where asterisks denote complex conjugates, and we close the tours by defining -Zn+ij 



(4) 



The first term, with coefficient A, tends to force \zij \ = 1. The second term, with coefficient 
B tends to force each Zij to one of n phase states, corresponding to the nth roots of unity. 
The terms with coefficients C and D penalize for synchronization within each row and within 
each column, respectively. The last term, expressed in terms of distances djji between cities 
j and j', tends to a minimum when each partial sum Y^'^y^'^ over synchronized oscillators 
at i,j and i + (i.e. for which arg^Zij) = arg(2j+ij/)), is minimized, that is 

when the specified tour has shortest distance, since it is expected that such partial sums 
will dominate the total sum. 

If the oscillators are governed by equations 

dL 



dz*j 



dL 

dzii 



(5) 



following Hoppensteadt^^, then the derivative of the Lyapunov function L = 
% + i^L/^^ij) Kj\ — Yliij ^ 0) so that the cost function is monoton- 
ically decreasing and must reach at least a local minimum, since L can be easily seen to be 
bounded below. 

For sufficiently large B, the units are each forced to one of the n allowed phase states. 
For the restricted configurations thus defined, it is easily seen that the sum of the terms 
in (jlj) with coefficients C and D is minimized when the n phases in any column or any 
row are all different. Such configurations define a set of n tours, some or all of which 
may be equivalent via cyclic permutation. Each such tour defines a set of units 
with the same relative phases, giving a contribution to the distance term in (jlj) equal to 
E E^ dj(^)j(i+i)Re{ i^;^^!;!, ) = smce arg(2;,j(,)) = arg(z,+ij(i+i)) for 



synchronized units. Each of the n such contributions is minimized when each tour length 
J2i is minimized. Additional contributions from cross terms between desynchro- 

nized units can be neglected in a stationary phase approximation. Coefficients A, B,C, D, E 
can then be chosen so that the various optimization criteria, forcing select phase states, 
forcing tour configurations, and minimizing distance, can indeed be satisfied simultaneously. 
A formal proof is given in Appendix A. 
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FIG. 2: Map, to scale, of the five cities defining the distances djj' used for all 5-city TSP simulations, 
showing the shortest tour, ACBED (solid line), and the second-shortest tour, ABECD (dashed 
line)^ 
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IV. RESULTS 



A. The n-phase CNN 



The dynamical equations derived from 



and are: 



dt 



2 



3' 



Ziji 



\Zij'\ 



Zilj 



I ^i'i I 



4 V u%i V 



Zi+l,j' 
\Zi+l,j' 



\Zi-l,j' 



^ij I ^ij I 



+ 



(6) 



A network governed by ([6]) almost never reaches a global minimum, or even a state 
defining a tour. For the set of pre-specified distances computed from the map in Fig. [2], the 
network reached a phase state shown in Table [H in a typical run. However, if gaussian noise 
is added to the dynamics at periodic intervals: 



z{t 



zmt) 



(7) 



whenever t = kr for some integer k and where ^ is a random complex number, then the 
system can be made to attain the desired state, provided that the amplitude of the noise 
in the phase of ^ (an analog "temperature") is decreased slowly enough. We used ^{kr) = 
pexp(i6fc), where the magnitude p is uniformly distributed between 0.7 and 1.3, and the 
random variable in the phase is a unit gaussian deviate with standard deviation af, that 
steadily decreases over time, i.e. a® = acrf-i^ starting from a value cxo = 4 that effectively 
randomizes the initial phase perturbations. Resulting states such as the one shown in Table 
HI] are indeed tours. Typical convergence histories are shown in Fig. [31 The tour state in 
Table [III corresponds to the shortest-distance tour among the 12 distinct tours shown in 
Table IIII| for the table of distances that was used to construct the network. This direct 
simulated annealing method is effective in selecting one of a small number of global optima 



TABLE I: Relative phases (from — vr to vr) of oscillators in the 5x5 CNN representation of the 5-city 
travelling salesman problem, after reaching a steady state, with coefficients A = 0.5, B = 0.08, 
C = D = 4.0, E = 0.4. The dynamical equations ([U]) were integrated using a finite difference 
scheme with time step At = 0.01. No tours are evident. 







slot 


in schedule 






1 


2 


3 


4 


5 


A 


-2.470 


-2.187 


-0.144 


1.284 


1.286 


B 


2.496 


-1.221 


1.404 


-2.484 


-0.012 


city C 


1.240 


2.643 


-2.619 


0.009 


-1.221 


D 


0.004 


1.088 


-2.637 


-1.271 


2.533 


E 


-1.292 


1.112 


0.049 


2.519 


-2.530 



(precisely 2x5! = 240, since there are 5! assignments of phase states to synchronized subsets, 
and two overall choices of direction) out of 5^^ network states. Locally optimal states are 
sometimes selected for a given annealing schedule, but the procedure appears to converge to 
a perfect one as the annealing rate is lowered. That is, as shown in Table IIIH the network 
appears more likely to converge to the shortest-distance tour as the analog "temperature" 
is lowered more slowly. 

The highly irregular convergence that is seen is to be expected as the system randomly 
jumps among basins of attraction corresponding to different local optima as the temperature 
is lowered. Only some of these optima correspond to tours. The more slowly the temperature 
is lowered, the greater the chance that the system will reach a "deep" global optimum, 
corresponding to a shortest-distance tour, and have insufficient energy to escape at that 
point in the process. The convergence pattern is typical of simulated annealing, as reported 
for instance by Kawabe et alM. 

Varying the different coefficients affects the network in a way similar to the variation 
of corresponding coefficients in the original fixed-state network. The largest effect comes 
from varying E, which multiplies the distance term, but increasing E tends to force the 
network toward non-tour states. Increasing E fourfold to = 1.6 tended to favor shorter- 
distance tours among the runs that converged to tours, as shown in the table, but most runs 
converged to non-tour states. No tours were obtained in 20 runs with a tenfold increase of 
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TABLE II: Relative phases as in Table HI but with noise of steadily decreasing amplitude included 
in the oscillator dynamics ([7]), using a noise reduction factor a = 0.9999993 every five time steps, 
i.e. at time intervals r = 0.05 = 5At. The tour found, ADEBC, is seen by focussing attention 
on oscillators with relative phase — 27r/5 ~ —1.28 (bold). Permutations of the same tour are 
represented by other collections of oscillators with approximately equal relative phases, e.g. those 
with phase ~ 0, defining the equivalent tour CADEB (italics). The tour class found is indeed the 
shortest-distance tour class for the pre-specified table of distances {djji) computed from Fig. [2j 







slot 


in schedule 
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A 


-1.282 


0.036 


1.240 


2.494 


-2.521 


B 


1.233 


2.485 


-2.500 


-1.320 


-0.016 


city C 


0.034 


1.280 


2.524 


-2.537 


-1.259 


D 


-2.498 


-1.293 


-0.013 


1.195 


2.502 


E 


2.493 


-2.423 


-1.190 


0.046 


1.285 



the coefficient to E = 4.0. 

The statistical significance of the selection of tour states over non-tour states is obvious. 
The observed tendency of the network to select shortest-distance tours is weaker, but still 
statistically significant. Formally, the inverse correlation of the number of occurrances of 
each tour class with tour length is significant at the 95% level for all three cases shown, and 
at the 99% level for the case of slow annealing. The tendency of the network to specifically 
select the shortest tour ACBED, among the twelve tour classes, is significant at the 97% 
level for slow annealing, and at the 99.9% level for fast annealing with E = 1.6. Additional 
details of the significance calculations are provided in Appendix B. The observed trends for 
slower annealing and for increased E add to the significance. 

For a larger number of cities n, preliminary experiments indicate that the architecture 
continues to select tour states over non-tour states and prefers shortest-distance tours. The 
convergence time increases, but is strongly dependent on the configuration of cities and the 
associated "energy" landscape. Typical results for an n = 7 problem are shown in Fig. 
m Parameters are the same as in the n = 5 example, except that the coefficients on the 
terms that force de-synchroniation within the separate rows and columns had to be increased 
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fourfold and the annealing schedule further slowed. In two out of three runs, the network 
converged to a tour state, in about twice the number of iterations required for n = 5. For 
n = 10, with the simple configuration shown in Fig. \5jp, and with no changes in parameters 
as compared to n = 5, the network converged to a globally optimal tour state even more 
quickly than in the n = 5 example (Fig. For other 10-city configurations, the network 
did not converge to any tour or non-tour state, even with vanishing noise level, with the 
same fixed-step numerical scheme used in the other examples. This is to be attributed to 
the complexity of the landscape and the large number of closely spaced local optima. An 
analog implementation, the ultimate target of the initial digital investigation, would obviate 
the numerical issues. 

Distance minimization is achieved in all cases because of the validity of the stationary 
phase approximation, according to which the minimum of the term in the Lyapunov function 
with coefficient E is also the minimum of the partial sum without cross-terms. Convergence 
histories of the total sum in this term for the 5-city example are plotted with the histories 
of the corresponding partial sums in Fig. [3l For the case shown in Fig. [3b, where a non- 
optimal tour is selected, the total sum is also seen to be slightly higher than the minimal 
value attained in the other two panels. The partial and total sums are widely separated. 
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FIG. 3: Network histories for three runs of the cellular neural network with parameters and 
annealing schedules as in Table |TT1 Runs shown are for slow annealing (a) and fast annealing 
(b,c), with randomly chosen initial conditions. Only states (squares) corresponding to tours are 
plotted, with tour lengths as indicated. For each such state, a tour length is also estimated 
(dots) as one-fifth the value of the distance-minimizing part of the Lyapunov function i.e. 



(1/5) E,;E, E,''^,yRe 



that gives an average over the choice of initial city, and 



would be equal to the actual tour length, if cross-terms for which avg{zij) 7^ aj:g{zi+ij') could be 
neglected in the sum over j and j' . 
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indicating that the cross-terms do not cancel, but sum to effectively random values that 
have no average effect on the minimization of the partial sums over synchronized units. 

B. The CNN with unrestricted phases 

The second term in (jl]), restricting the phase of each oscillator to one of n values, can 
be dropped, but the temperature must be lowered even more slowly to attain a globally 
optimal pattern. Adding noise at every time step, at a level that is reduced by the factor 
a = 1 — 1/10'' after each time step, we obtained results for a single long run that are displayed 
in Table [IVl The shortest-distance tour was indeed selected, but the global optimality of 
such states for the general problem remains to be proved. 

V. CONCLUSIONS AND PROPOSED EXTENSIONS 

Synchronous firing of neurons, sometimes widely separated in the brain, is a ubiquitous 
phenomenon that has been advanced as an explanation for perceptual grouping^»^»^, mo- 
tor control^ii^, and consciousnes3^ii2i>22. Information in the brain may generally be coded 
in synchronization patterns. The dynamical evolution of such patterns would then define 




-1 r i 
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a) time(n) b) 1 

FIG. 4: Network history (a) for a run of the cellular neural network for 7 cities plotted as in Fig. [3l 
Network parameters are as in the 5-city case, but with C = D = 16.0 and annealing schedule given 
by a = 0.99999995, r = 0.02 = 2At. In two out of three runs with the city map (b), the network 
converged to tour states: the optimal tour ADCEGBF (solid) and a suboptimal tour AFCBGED 
(dashed). 
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FIG. 5: Network history (a) for a run of the cehular neural network for 10 cities plotted as in Fig. [3l 
Network parameters are as in the 5-city case, with with a "slow annealing" schedule (a = 0.9999998, 
T = 0.02 = 2 At). With cities placed on a circle, with uniform spacing, in arbitrary order (b), the 
network converged to the optimal tour. 

mental processing. One may inquire as to the utility of synchronization patterns for repre- 
sentation of objective states. 

The main result of this paper is that the evolution of synchronization patterns can be 
organized so as to satisfy a complex optimization principle. Indeed, the 5-city architecture 
is effective in selecting 2x5! = 240 shortest-distance tours out of 5^^ possible states. The 
original Hopfield architecture, by contrast, would select 2 x 5 = 10 binary- valued tour 
states out of 2^^ possibilities. The task of the synchronization network is more difficult by 
a factor of ^ (f)^^ ~ '^■'^ ^ ^^^^ ^ factor which grows exponentially with the number of 
cities. The landscape searched by the synchronization network is correspondingly rougher, 
requiring shorter time steps in a digital implementation. A large research effort with the 
original Hopfield architecture resulted in effective performance only for relatively small n, 
up to about 100. Preliminary indications are that the synchronization network will be 
competitive for simple configurations of cities and may reach at least the original Hopfield- 
Tank- level (n = 30) for arbitrary configurations, with improved numerics or in an analogue 
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TABLE III: Tour lengths for the twelve tour classes in the 5-city problem, with the number of 
times each class occurs as the final state in 20 runs of the CNN architecture using a fast annealing 
schedule (a = 0.999999, r = 0.05 = 5 At), in 20 runs with the same fast annealing schedule 
but with the weight E of the distance-minimizing term increased fourfold, and in 20 runs with a 
slow annealing schedule (a = 0.9999998, r = 0.02 = 2At). Other parameters are as in Table [H 
Synchronization, as used to define tour patterns, is deemed to occur when the relative phases of 
two or more units differ by less than 0.6. 



final state 
tour class 


tour length 


number of occurrences 


fast annealing 


slow annealing 


E = 0.4 


E = 1.6 




ACBED 
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2.349 
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ABDCE 


2.388 








non-tour 




6 


14 


6 



implementation. The demonstrated skill is surprising in view of the increased difficulty of 
the task in the new representation. Further, the network's sensitivity to the complexity of 
the map is suggestive of a human's response to the same problems. Thus the results support 
synchronization-based theories of biological information processing. 

Freemanf2ii22 has suggested a fundamental role for chaos in intelligent information pro- 
cessing. (See also Nara^.) In Freeman's view, chaos provides a combination of sensitivity 
and stability that might account for the observed range of mental competence in the face 
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TABLE IV: Relative phases as in Table HI but for a network with B = in which the phases are 
unrestricted, and with very slow annealing, given by a = 0.9999999 and r = 0.01 = At in ([7|). The 
other CNN coefficients are A = 0.5, C = D = 20.0, and E = 0.12. After a very large number of 
iterations, the network converged to the shortest tour. 
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2.316 


-2.583 


-1.209 


-0.151 


1.289 
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-1.607 


0.129 


1.237 


2.335 


-2.286 


city C 


-2.878 


-1.291 


0.321 


1.227 


2.372 


D 


0.929 


2.600 


3.032 


-1.482 


0.461 


E 


0.392 


1.077 


2.344 


-2.610 


-1.388 



of unique inputs. The known highly intermittent synchronization of 40Hz oscillations was 
taken to support the suggested role for chaos. Information is thought to be coded in spatial 
patterns of coherent activity that exist for very brief periods of time, typically less than 200 
milliseconds. 

Synchronization networks of the type described in this paper can be naturally extended 
to a chaotic framework. If the limit cycle oscillators were to be replaced by chaotic oscil- 
lators, chaos might replace noise in the simulated annealing scheme. (The "chaotic oscil- 
lators" might be higher-level constructs formed from groups of units with incommensurate 
frequencies.) Synchronization of chaotic oscillators typically degrades via on-off intermit- 
tency, wherein the synchronized phase is interrupted at irregular intervals by bursts of 
de-synchronizatiori2ii2^. A stable synchronization manifold, like other invariant manifolds, 
typically contains unstable periodic orbits (UPOs) embedded within strange attractors. Tra- 
jectories that approach such orbits while still off-manifold commonly burst away. That be- 
havior would resemble the noise-induced jumping among different synchronization patterns 
in the network of regular oscillators studied here. 

Bursts away from local optima might play the role of a "reset" mechanism such as the 
one introduced in Adaptive Resonance Theory^ for old-style networks. Unlike the situation 
with associative memory networks, continuous reset is here desirable. The dynamical regime 
of nearly continuous bursting^I might be identified with the biological case, in which the syn- 
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chronized phase is very short-lived. Slow Hebbian learning, to adjust connection strengths, 
might be used for "permanent" global optimization, if desired, replacing the gradual low- 
ering of temperature in the simulated annealing scheme. Such chaotic architectures will be 
the subject of future work. The simplified limit cycle version confirms the representational 
utility of synchronized oscillator networks. 
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APPENDIX A: Proof of the optimality of shortest-distance tours for the n-phase 
synchronization network 

Without loss of generaUty, consider the TSP for a table of distances expressed in units 
such that dij < 1 for all i,j. The Lyapunov fuction, previously expressed as (jlj) is: 
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I Zij I 
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-^EE 
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The term 



Zij ^i+lj' 



(8) 



(9) 



\Zij\ \Zi-\-\j' 

is intended to minimize total tour length. Consider strict tour configurations for which 
each Zij is one of the nth roots of unity, i.e. zij = exp{27Timij/n) for some integer rriij, 
< rriij < n, and the integers rriij are all different within each column and within each row. 
That is, we allow rriij = rniiji only ii i = i' and j = j' or if i 7^ i' and j 7^ j'. Among such 
strict tour configurations, let m?- be chosen so as to minimize T>. 

In a stationary phase approximation, one could relate D to a sum T> without the cross 
terms: 

~* 

Zij ^i+lj' 



V 



E E 



djjiRe 



Zij\ 



We do not require equality or proportionality of V and f), but make a weaker assumption 
that tour configurations that minimize T) also minimize P, i.e. 

Zij 



< E E ^n'^^ 

{j3'\'m-i+i,j'=™-ij} 



\Zij I l^j+l, 



(10) 



for any tour configuration given by {zij = exp(27rzmjj/n)} that is not necessarily shortest- 
distance. We regard (fTOll as empirically validated. 

We can then show that the Lyapunov function ([8]) is appropriate for the TSP by proving 
the following theorem. 
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Theorem: There are values of the coefficients A, 5, C, and in the Lyapunov function 
L : C"^ TZ that is defined in ([8]), such that the global minima of L occur at states 
{zij : i,j = l...n} that correspond to shortest-distance tours, and only occur at such 
states, provided that the assumption flTUl) holds. The required coefficients can be chosen 
universally for the n-city problem, except for dependence on the difference in the value of 
V (as defined in (Q) between the shortest and the second-shortest tour configurations. 

Proof: 

The ffist term in ([8]) is a function only of the magnitudes \zij\, while the remaining 
terms are functions only of the phases that define the normalized values Zij/\zij\. Thus 
the ffist term, which is non-negative, can be minimized separately, by choosing all \zij\ = 1, 
so that the term vanishes. 

Note that the third, fourth, and ffith terms satisfy the respective bounds: 
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(12) 
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(13) 



We ffist show that each normalized value Zij/\zij\ can be made to differ from one of the 



nth roots of unity by an arbitrarily small amount Cjj when L is minimized. Define the 
deviations eij by 



2n — 
ij ~ 



(14) 



for a set of values {%} that minimizes L. If the remaining coefficients C = D = E = 0, 
the minima of L occur when each eij = 0. For nonvanishing C,D, and/or E, no summand 
in the term with coefficient B can raise the minimum value by an amount greater than the 
range of values of the remaining three terms, so Befj' < An^C + An^D + 2n^E, or 



e,'" < {An^C + An-^D + 2n'^E)/B 



(15) 



Thus, by choosing B sufficiently large as compared to C,D, and E, each eij can be made as 
small as desired. 
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Choose B so that each normahzed value Zij/\zij\ is much closer to one particular nth root 
than to any other, i.e. eij << |1 — exp(27rz/n)|, as will follow from f|T5l) if 

{An^C + An^D + 2n^E)/B « \1 - exp{2m/n)\'^'' (16) 

For any given set of deviations {Uj}i<i,j<n, one is free to choose one of the n roots arbitrarily 
for each member of the set. It is easily seen that the terms with coefficients C and D are 
minimized when the n values in each row and each column, respectively, are spaced as 
widely as possible about the unit circle, while the term with coefficient B is invariant. Thus 
if E = 0, the minima of L occur when the phases that are approximately nth roots of unity 
are all different in each row and each column, i.e. for configurations that define n "tours" . 

If E is sufficiently small, then the minima of L still occur at tour configurations. Specif- 
ically, let A be the minimum difference between the value of a sum over a single row or 
column (X^ii' oi' &) phases different and the sum for the case of one phase 

duplication. One finds 

A = |1 -exp(27ri/n)|2 (17) 
Then, using f|T3|) . the minima of L occur at tours if 

n^E <CA + 0{e^) n^E <DA + 0{e^) (18) 

where e = maxjjjejj}. 

For strict tour configurations (with all e^j = 0), the last term in ([8]), EV, is separately 
minimized for shortest tours, since by assumption ( fTOl) . EV is minimized for values Zij = 
exp(27rim^j/n) that minimize ET>. The latter can be written as a sum over n tours, n = 
1, . . . , n, each tour defined by a set of n units with equal phases: 

n 

E:D = Ej]d,^(,),,4,+i) (19) 

K=l 

where k labels the tour and jnii) is the city visited at time i while on tour k. This sum is 
minimized if and only if all the are shortest-distance tours. 
For non-strict tour configurations with eij ^ 

P = ^ ^ ^ rfj-jvRe (e2-^'^'./"e-2-i"^>+i,//") + 0{t^) (20) 
i j j' 
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for some {friij}. This sum is minimized for shortest-distance tours provided that the differ- 
ence 6 = \V{{'m9j}) — V{{mlj})\ in the value of V between the shortest tour configuration 
{m° } and the second-shortest tour configuration {mjj} is not too small, i.e. 

6 » 0{e^) (21) 

which is satisfied, according to the bound (fTSi) . if 

{An^C + An^D + 2n^E) /B « 5" (22) 

For given 5, we note that the coefficients B,C,D, and E can always be chosen so as to 
satisfy ffTBl) . ffTS]) . and fl22]) . completing the proof. 
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APPENDIX B: Statistical significance of the results in Table HII 

The correlation between tour-length di and number of occurrences n^, as listed in the 
second and third columns of Table UTTl respectively, for the case with fast annealing and E = 
0.4, is r = Yli=i^i''^i/i'^d'^n) = —0.62. If the data were truly nncorrelated, the correlations 
would be t-distributed about r=0. In such a distribution with 12 — 2 = 10 degrees of 
freedom,, r = —0.62 corresponds to a t-score of t = 2.50, and the probability of r < —0.62 is 
found to be less than 5%, using a t-distribution table or online calculator—. So the negative 
correlation is significant at the 95% level. 

The significance of the rate of occurrence of the optimal tour, ACBED, among all tours 
found, is calculated as follows: In the slow annealing case, ACBED occurs in 4 instances 
out of the 14 tour states chosen. If all 12 possible tour states were selected with equal 
probability, the probability of 4 or more occurrences of the shortest-distance tour would be: 




= 0.024 



implying significance at the 97% level. 

Significance levels of the correlations and of the shortest-distance tour selection rates are 
computed similarly for the other 5-city networks, with results as shown in Table |Vl (The 
fast-annealing scheme with E = 0.4 did not select the optimal tour at a significant rate.) 

The trends in Table IIIII as E is increased and as annealing is slowed lend additional 
significance to the results. 

TABLE V: Correlation r between length of a tour and frequency of selection of the tour for the 
three network configurations represented in Table IIIH with the t-score of each correlation, the 
significance level, and the significance level (if any) of the selection of the shortest-distance tour. 





fast annealing 


slow annealing 




E = 0.4 


E = 1.6 




r 


-0.62 


-0.59 


-0.87 


t-score 


2.50 


2.30 


5.58 


significance level of distance/frequency correlation 


95% 


95% 


99.9% 


significance level of shortest-distance tour selection 




99.9% 


97% 
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